Unsupervised extraction of semantic relations (Extraction non supervisée de relations sémantiques lexicales) [in French]
نویسندگان
چکیده
This paper presents a knowledge base containing triples involving pairs of verbs associated with semantic or discourse relations. The relations in these triples are marked by discourse connectors between two adjacent instances of the verbs in the triple in the large French corpus, frWaC. We detail several measures that evaluate the relevance of the triples and the strength of their association. We use manual annotations to evaluate our method, and also study the coverage of our ressource with respect to the discourse annotated corpus Annodis. Our positive results show the potential impact of our ressource for discourse analysis tasks as well as semantically oriented tasks. Mots-clés : discours, sémantique, sémantique lexicale.
منابع مشابه
Semantic relation clustering for unsupervised information extraction (Regroupement sémantique de relations pour l'extraction d'information non supervisée) [in French]
Semantic relation clustering for unsupervised information extraction Most studies in unsupervised information extraction concentrate on the relation extraction and few work has been proposed on the organization of the extracted relations. We present in this paper a two-step clustering procedure to group semantically equivalent relations : a first step clusters relations with similar expressions...
متن کاملUnsupervised extraction of semantic relations using discourse information. (Extraction non supervisée de relations sémantiques par l'analyse du discours)
Natural language understanding often relies on common-sense reasoning, for which knowledge about semantic relations, especially between verbal predicates, may be required. This thesis addresses the challenge of using a distibutional method to automatically extract the necessary semantic information for common-sense inference. Typical associations between pairs of predicates and a targeted set o...
متن کاملExtraction et regroupement de relations entre entités pour l'extraction d'information non supervisée
This article takes place in the context of unsupervised information extraction in open domain and focuses on the extraction and the clustering at a large scale of relations between named entities without defining their type a priori. The extraction step combines the use of basic but efficient criteria and a filtering procedure based on machine learning. The clustering step organizes extracted r...
متن کاملMise en lumière de relations sémantiques pour la construction d'ontologie à partir de textes
Résumé : La construction d’ontologies à partir de textes consiste à décrire des concepts par leurs relations conceptuelles et éventuellement leurs instances, à partir des matériaux textuels (termes, relations lexicales). Cet article propose une méthode pour mettre en lumière, par l’analyse de corpus, des relations lexicales susceptibles de donner naissance à des relations conceptuelles. Cette m...
متن کاملUnsupervised selection of semantic relations for improving a distributional thesaurus (Sélection non supervisée de relations sémantiques pour améliorer un thésaurus distributionnel) [in French]
Unsupervised selection of semantic relations for improving a distributional thesaurus Work about distributional thesauri has shown that the relations in these thesauri are mainly reliable for high frequency words. In this article, we propose a method for improving such a thesaurus through its re-balancing in favor of low frequency words. This method is based on a bootstrapping mechanism : a set...
متن کامل